Text Detection and Extraction from Document Images using K-Nearest Neighbor Rule
نویسنده
چکیده
Text extraction and text line detection is the foundation of document image analysis. Since many years, a large number of text detection methods have been proposed, where these methods depend on convinced assumptions of documents with various font style, font size, distorted text, uneven lighting, complex background and low resolution. In this paper, reveals k-nearest neighbor rule as a generic text-line detection and text extraction approach that can be applied on a complex mail document images. The performance evaluation of transition map generation and it compares with other two models is presented in this paper. Experimental analysis shows that image based text Optical Character Recognition (OCR) method is to extract the text from the colorful image and detection of advertised mails is very efficient than that of the other existing methods. KeywordsText extraction, Text-line detection, KNN rule, and mail document image.
منابع مشابه
An Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification
The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...
متن کاملAn Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification
The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...
متن کاملExtraction of Suitable Features for Breast Cancer Detection Using Dynamic Analysis of Thermographic Images
Introduction: Thermography is a non-invasive imaging technique that can be used to diagnose breast cancer. In this study, a method was presented for the extraction of suitable features in dynamic thermographic images of breast. The extracted features can help classify thermographic images as cancerous or healthy. Method: In this descriptive-analytical study, the images were taken from the IC/UF...
متن کاملEdge Detection Based On Nearest Neighbor Linear Cellular Automata Rules and Fuzzy Rule Based System
Edge Detection is an important task for sharpening the boundary of images to detect the region of interest. This paper applies a linear cellular automata rules and a Mamdani Fuzzy inference model for edge detection in both monochromatic and the RGB images. In the uniform cellular automata a transition matrix has been developed for edge detection. The Results have been compared to the ...
متن کاملExtraction of Suitable Features for Breast Cancer Detection Using Dynamic Analysis of Thermographic Images
Introduction: Thermography is a non-invasive imaging technique that can be used to diagnose breast cancer. In this study, a method was presented for the extraction of suitable features in dynamic thermographic images of breast. The extracted features can help classify thermographic images as cancerous or healthy. Method: In this descriptive-analytical study, the images were taken from the IC/UF...
متن کامل